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We explore the possibility of very long-lived gravitational-wave transients (and detector artifacts) 
lasting hours to weeks. Such very long signals are both interesting in their own right and as 
a potential source of systematic error in searches for persistent signals, e.g., from a stochastic 
gravitational-wave background. We review possible mechanisms for emission on these time scales 
and discuss computational challenges associated with their detection: namely, the substantial volume 
of data involved in a search for very long transients can require vast computer memory and processing 
time. These computational difficulties can be addressed through a form of data compression known 
as coarse-graining, in which information about narrow frequency bins is discarded in order to reduce 
the computational requirements of a search. Using data compression, we demonstrate an efficient 
radiometer (cross-correlation) algorithm for the detection of very long transients. In the process, 
we identify features of a very long transient search (related to the rotation of the Earth) that make 
it more complicated than a search for shorter transient signals. We implement suitable solutions. 


I. INTRODUCTION 

Previous work has explored the astrophysics of (and as¬ 
sociated detection strategies for) long-lived gravitational- 
wave transients lasting «10-T000s; see, e.g., ehui and 
references therein. Most long-transient models rely on 
the emission of gravitational waves from rotational in¬ 
stabilities in a protoneutron star or its accretion disk but 
other models are possible as well |I[. Initial LIGO/Virgo 
have performed a search for unmodeled long lived tran¬ 
sients associated with long gamma ray bursts yielding 
upper limits on gravitational-wave fluence [12]. 

Despite this progress, there is still a largely unexplored 
region of parameter space of “very long-lived” transient 
signals lasting hours to weeks; (although, see 0 )- As we 
discuss below, it may be possible for neutron stars to emit 
transient gravitational waves on these time scales. More¬ 
over, exotic models allow for the possibility of a seem¬ 
ingly persistent signal to start or stop during an observ¬ 
ing run 0, also potentially leading to very long tran¬ 
sient signals. Thus, there is astrophysical motivation to 
carry out a search for very long lived gravitational wave 
transients. An efficient very long transient detection al¬ 
gorithm will have other useful applications: it can estab¬ 
lish if an apparently persistent source, e.g., observed in a 
stochastic background search fl5| . exhibits variability in 
time, and it can be used to understand the behavior of 
detector artifacts on timescales of days to weeks. 

Our goal here is to outline a computationally feasible 
search for very long-lived transients employing radiome- 
try [[ff| • We review the principles of radiometry below in 
Section ED but for now, we note that radiometry relies 
on the cross-correlation of two or more detectors in order 
to detect gravitational waves as excess coherence. Ex- 
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cess coherence can be identified through various pattern 
recognition techniques. For demonstrative purposes, we 
here employ a “seedless clustering” algorithm [10, 1 ll ]. 
discussed in greater detail in Section m though, other 
pattern recognition techniques may be used as well. 

In order to carry out such a search, it is necessary 
to first address computational challenges that appear, at 
first glance, to make the search daunting. Namely, dif¬ 
ficulties arise due to the fact that only so much uncom¬ 
pressed spectrographic strain data can be stored at once 
in random access memory (RAM). Without compression, 
it is impossible to hold many hours of data in memory at 
once on a typical computer. In theory, data can be read 
in as needed, but this creates an input/output bottleneck. 
Also, in searches that rely on seedless clustering 00, 
the computation time required to sum hours of uncom¬ 
pressed spectrographic data can become prohibitive. 

The solution we propose is an extra pre-processing 
step in which data are compressed in order to facilitate 
computationally feasible analyses. Through a procedure 
known as coarse-graining (alr eady used in searches for 
the stochastic background [17|), the volume of data can 
be reduced by orders of magnitude so that days of data 
can be represented with a spectrogram comparable to one 
used to represent minutes of uncompressed data. Coarse- 
graining necessarily discards information, but through 
this procedure, we may study much longer transient sig¬ 
nals than otherwise possible. 

The remainder of this paper is organized as follows. In 
Section [Til we discuss the motivation for searches for very 
long transients. In Section mu we review key principles 
of radiometry needed for subsequent discussion. (Ad¬ 
ditional details are included in the Appendix.) In Sec¬ 
tion IIV1 we describe compression procedures for coarse- 
graining, and in Section El we demonstrate the recovery 
of very long-lived signals using compressed data. In Sec¬ 
tion ED we summarize our results and suggest some lines 
of future research. 
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II. MOTIVATION 

The scientific rationale for a search for very-long tran¬ 
sients, spanning hours to weeks, was first explored in 0 . 
The authors of (TTj} | review a number of somewhat spec¬ 
ulative scenarios associated with neutron stars including 
gravitational-wave emission lasting days to months from 
non-axisymmetric Ekman flow following a glitch fl~8l - |20j | , 
Alfven oscillations from gian t magnetic flares (also last¬ 
ing days to months) j2j, [22| , emission from free preces¬ 
sion (with a damping time possibly lasting from weeks to 
years) (23l - (25l ] , magnetic instabilities in newborn neutron 
stars (lasting days) [26j . and gravitational-waves from r- 
modes [27l.l28j. 

Generic rotational instabilities in newborn neutron 
stars, potentially powered by fallback accretion @, 0, 
may persist on a timescale of hours 0] ■ Somewhat more 
speculatively, we note that observations of intermittent 
pulsars, which become quiescent on timescales of days 
(e.g., [30(), suggest that neutron star dynamics vary on 
the timescales considered here, motivating exploration of 
this region of parameter space. Similarly, variability in 
accretion on these timescales may affect gravitational- 
wave emission from accretion-supported mountains [3ll ]. 
Finally, it is worthwhile to be prepared for a surprise: 
a very long lived transient signal from an unexpected 
source. Recent work proposing gravitational-wave emis¬ 
sion from gravitationally bound axion clouds 0 , po¬ 
tentially starting and stopping on the timescale of a few 
years, serves to illustrate this possibility. The method 
we propose below is also applicable to quasi-infinite sig¬ 
nals turning on or off during an observing run as well as 
repeating sources with long-lasting emission periods. 

An algorithm for identification of very long transients 
also provides a means of understanding the time depen¬ 
dence of apparently persistent signals, e.g., in a stochastic 
background search. In the event of an ostensibly persis¬ 
tent gravitational-wave detection, it is prudent to inves¬ 
tigate if the temporal behavior of the signal is consistent 
with the assumed signal model. Matched filter searches 
for rotating neutron stars, for example, assume that the 
strain is constant over the duration of the measurement. 
An observation of non-trivial time-dependence could be 
evidence of new physics. 

In addition to the astrophysical motivation, the al¬ 
gorithm provides useful information for detector char¬ 
acterization. Recent work has shown that geophysical 
effects such as global Schumann resonances can induce 
correlated noise in worldwide networks of gravitational- 
wave detectors in a way that can bias cross-correlation 
searches [32, 33j. Some geophysical phenomena (such 
as electromagnetic activity induced by solar flares) ex¬ 
hibit temporal fluctuations on very long timescales. Un¬ 
derstanding the time dependence of excess cross correla¬ 
tion can help to rule out (or lend credence to) different 
explanations of an apparent signal in searches for per¬ 
sistent gravitational waves in both targeted radiometer 
searches [HI and searches for stochastic backgrounds 0. 


In order to see how very long signals might escape de¬ 
tection without a dedicated search, we note simple scal¬ 
ing behavior. Radiometric signal-to-noise ratio (SNR, 
defined below; or see Eq. 3.11 of pj) scales like 

SNR«C*obs> (1) 

where i is the observation duration and £ is the frac¬ 
tion of the observation during which the signal is present. 
To be concrete, let us consider an Rill hr-long signal with 
SNRtot = 40. While this signal is exceptionally loud, it 
can be significantly diluted if we search for it assuming 
a persistent signal model. For example, if we integrate 
over the entirety of a 90 day science run, the signal-to- 
noise ratio will be diluted by a factor 


(1)\/11 hr 1 90 days 

(11 hr/90 days)v / 90 days V 11 hr 

so that SNRtot is reduced from 40 to ~3, which is not 
statistically significant given a few thousand indepen¬ 
dent frequency bins. It follows that a very long transient 
signal, loud enough to affect a radiometer or stochastic 
search, should produce an easily identifiable signal using 
an algorithm dedicated to detection of very long transient 
phenomena. 

If we attempt to resolve the signal using 1 s segments, 
the signal-to-noise ratio is diluted by a factor of 


(l)VTThr 

(i)vTs 


200 , 


so that SNR 0.20 in each segment. This back-of- 
the-envelope argument illustrates how even exceptionally 
loud very long signals can escape detection without a ded¬ 
icated search. 


III. RADIOMETRY 

The basic idea of radiometry is to cross-correlate data 
from two or more detectors. By integrating over as much 
data as possible, it is possible to identify weak sig nals 
buried in noise. Drawing on previous results 0 , 1130 , 
we take as our starting point spectrograms of cross- 
correlated data. Details are provided in Appendix [H 
For the sake of simplicity, we focus here on a two-detector 
network, but note that the algorithm generalizes straight¬ 
forwardly to arbitrarily many detectors [lj. The cross- 
correlated data can be represented as a complex-valued 
signal-to-noise ratio 0 (see Eq. IA4I in Appendix [A} : 

p(f;/) = Ksl(t; f)s 2 (t; f) j \J P{(t; f)P^{t;f). (2) 

Here, sj(f;/) is the discrete Fourier transform of the 
strain series from detector /, P'(t;f) is the auto-power 
spectrum calculated using neighboring data segments, 
and k is a normalization factor; (see Eq. IA5I in the Ap¬ 
pendix). Each segment has duration 5t. The choice of St, 
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which we examine in greater detail below, is determined 
both by astrophysical considerations and computational 
limitations. The argument t is the mid time of the data 
segment and / is frequency. Note that p(£;/) does not 
depend on the direction of the source Cl because the di¬ 
rection of the source is encoded as a complex phase. 

If the data consist of well-behaved noise, both the real 
and imaginary parts of p are characterized by distribu¬ 
tions with mean=0 and variances 1. Gravitational-wave 
signals induce excess p with a complex phase angle T 
determined by the direction of the source and its fre¬ 
quency /»: 


T; = —2-KfiCl ■ A x/c 


( 3 ) 


Here, Ax is the difference in detector position and c is the 
speed of light. For the LIGO detectors, |Ax|/c ~ 10 ms. 
Thus, a signal can, e.g., induce real negative or positive 
imaginary values of p. Note that is time-dependent due 
to the rotation of the Earth. This behavior is illustrated 
in Figs, [iji and [[]:), which show spectrograms of the real 
and imaginary parts of p for data consisting of Advanced 
LIGO [36(, Gaussian, Monte Carlo noise at design sen¬ 
sitivity plus a persistent monochromatic signal with fre¬ 
quency / = 100 Hz and strain amplitude ho = 1 x 10 -22 , 
located at (ra, dec) = (18.5 hr,+39°). 

These spectrograms use (St, A/) = (158 s, 1 Hz) reso¬ 
lution and span two days of data. We utilize a coarse- 
graining procedure described below. The repeating pat¬ 
tern of light and dark at 100 Hz repeats with a period 
of one sidereal day. This pattern, which varies with dec¬ 
lination and frequency, shows how the phase factor Hq 
(Eq. [3]) varies with the rotation of the Earth. It is inter¬ 
esting to note that these horizontal stripes are a distinct 
but analogous effect to the vertical stripes (also arising 
from radiometer phase mismatch) described in 0 - 
If we multiply p by (an array of) the phase factor 
exp (— i'&i) in order to “point” the radiometer map in 
the source direction (and reduce the injected strain am¬ 
plitude to ho = 4 x 10~ 24 ) we obtain Fig. QJ. With the 
appropriate phase factor applied, the cross-power due to 
signal is real and positive. However, there is still a time- 
dependent modulation with a period of one sidereal day, 
owing to the time dependence of the detectors’ antenna 
response; see Eq. IA2I Fig. [TJI shows the signal recov¬ 
ered using a seedless clustering algorithm 11| to look 
for tracks of bright pixels. This recovery is described in 
greater detail in Section [V] 

In order to carry out a search for a transient signal as¬ 
sociated with some direction H, we construct a detection 
statistic (see Eq. IA9I in Appendix [A}: 


SNR tot (f2) = 


Re [E ie r ex P H^i) Pi ^ 12 ] 


Sier ( e i; 


1/2 


( 4 ) 


The ej 2 (H) factor describes the efficiency of the detec¬ 
tor pair [1| (see Eq. IA2I in Appendix 0. The sum over 


i runs over some set of (£; /) pixels T, which is deter¬ 
mined by details of the search. A search for persis¬ 
tent narrowband signals, for example, might sum over 
time at a fixed frequency. In searches for transient sig¬ 
nals, the set of (t; f) pixels may be described, e.g., by 
a Bezier curve [10, HH, [33] - In the analysis that follows, 
we employ clustering with parametrized quadratic Bezier 
curves, each with a different randomly chosen direction. 
This family of curves includes straight lines as a subset. 
For fixed Cl, ej 2 (fi) varies as the Earth rotates. While this 
effect is usually not important on timescales of <1000 s, 
it plays a significant role for very long signals. Finally, 
note that Eq. [4] assumes (for the sake of compact nota¬ 
tion) that the noise is approximately stationary and that 
the signal does not vary significantly with frequency; see 
Eq. IA7I These assumptions can be straightforwardly re¬ 
laxed using Eq. IA6I 


IV. COMPRESSION 
A. Coarse-graining 

In order to facilitate the search for very long-lived sig¬ 
nals, it is desirable to compress the cross-correlation data 
used in long transient searches, e.g., 0 - Past analyses, 
targeting signals with duration ss 10-1000 s, utilized (50% 
overlapping) 1 s-long segments. While this does not pose 
a significant computational challenge for the analysis of a 
1500 s-long window (3000 segments), it is not practical for 
the analysis of 10hr = 36000s signals (72000segments). 
First, it is currently impractical to load the data from so 
many segments into RAM. Second, even if we could get 
around the memory problem, it would take an inordinate 
amount of computation time to treat so many segments 
separately. 

One solution is to average data from neighboring fre¬ 
quency bins in order to compress the data into a for¬ 
mat that preserves the information needed to search 
for very long signals while averaging away details about 
short timescales. This compression, known as “coarse- 
graining,” enforces the assumption that the signal varies 
slowly enough that the change in frequency as a func¬ 
tion of time is small compared to the bandwidth of each 
spectrogram pixel. Previous radiometer analyses [H, |38} 
have already employed a form of coarse-graining as part 
of a standard data conditioning procedure, but in these 
cases, the end data product is a single spectrum with no 
time-dependence (as opposed to a spectrogram), and so 
the coarse-graining was somewhat incidental. 

We follow the coarse-graining procedure first outlined 
in 0 (see Eq. 5.4), which we reproduce here in slightly 
different notation. First, we define cross-power C(t; f) = 
(2/Af)s*(t; f)§ 2 (t\ /) where Af is a Fourier normalization 
constant; see Eq. IA5I in Appendix [A] This allows us to 
rewrite Eq. [2] as j47| : 

p(t; f) = V2 C(t, f) / y/P{(t; f)P'(t ; /). (5) 
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FIG. 1: The daily modulation of p. Each panel shows a ( 5t , A/) = (158 s, 1 Hz) spectrogram consisting of two consecutive days 
of data. The data consist of Advanced LIGO 0] Monte Carlo noise plus a simulated signal at / = 100 Hz. The top-left panel 
(a) shows Re(p) while the top-right panel (b) shows Im(p). In the top panels, the injected signal, for illustrative purposes, is 
very loud: ho = 1 x 10 -22 . (Since the signals are very loud, the color bar is saturated.) At t = 0, the phase angle is in the 
lower-right quadrant of the p complex plane where Re(p) > 0 and Im(p) < 0. It rotates clockwise so that Re(p) crosses zero 
and becomes negative before Im(p) crosses zero and becomes positive. The bottom-left panel (c) shows Re [exp (—i4q) p] for a 
much weaker signal: ho = 4 x 1CP 24 . The phase factor is applied to make the signal entirely real; see Eq. IaTI The bottom-right 
panel (d) shows the recovered signal (the most significant cluster identified in the bottom-left panel) recovered with 2 x 10 6 
seedless clustering templates (see Section [V] for details). The source is located at (ra, dec) = (18.5 hr,+39°) and it is recovered 
7° degrees away at (18.9hr,+35°). The recovery algorithm does not assume that the signal is monochromatic. 


The coarse-grained cross-power is given by: 

,, /=/cg+A//2 

CcG(t-Jc G ) = E (6) 

1 /=/cg-A//2 

Here, A/ is the width of the new coarse-grained bins, 6f 
is the original fine-grained bin width, and fcG are the 
new coarse-grained frequency bins. The variable w(f) is 


a weight factor, which is unity when / ^ /cg ± A//2. 
When / = /cg ± A//2, then w(f) is some number on 
[0,1], which takes into account the fact that the edges 
of a coarse-grained bin may fall in between the edges of 
two fine grained bins. The precise value is given by the 
fraction of the fine-grained bin that overlaps with the 
coarse-grained bin. Thus, e.g., a A/ = 0.25 Hz coarse¬ 
grained bin spanning 10.0-10.25 Hz would include a w = 
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0.5 overlap with a Sf = 1 /60 Hz fine-grained bin spanning 
9.98-10.017 Hz. 

Eq. [G] can be understood as an application of Par- 
seval’s theorem. The analogous coarse-grained auto¬ 
powers, P'nn(t\ f), P^ CG (t; /) are calculated using 
Welch’s method [39]]. We thereby obtained the coarse¬ 
grained estimator: 


where if) is the angle between Ax and Ail. The loss of 
signal-to-noise ratio is (conservatively) maximized when 
cos(tp) = 1. Thus, we obtain the following expression for 
St: 


cos 1 (R) 

2i r/ t 0 (7r/24 hr) ’ 


( 11 ) 


PcG(t;f)=V2Cc G (t-J)/^/p{ CG (t;f)P^ CG (t-J). (7) 

In addition to Eq. [6l we considered an alternative 
coarse-graining procedure performed in the time domain, 
which we refer to as “hybrid compression.” Instead of 
combining many narrow frequency bins associated with 
long segments, we combined many short segments associ¬ 
ated with wide frequency bins. While the two procedures 
produced similar results, we find that the frequency- 
domain implementation reduced spectral leakage by up 
to tens of percent in power compared to the time-domain 
implementation, thereby allowing for better signal recon¬ 
struction. That said, there may be applications in which 
it it useful to carry out hybrid compression: first in the 
frequency domain and then in the time domain. We re¬ 
turn to this idea at the end of the next subsection. 


B. Limitations 


The rotation of the Earth places a limit on the extent 
of possible compression [H}. If the segment duration St 
is too long, the Earth will rotate significantly during the 
duration of the segment so that it becomes impossible to 
“point” the radiometer in different directions by weight¬ 
ing each spectrogram pixel with the appropriate phase 
factor; see El In practice, this translates to a loss of 
signal-to-noise ratio. 

Following El, the fraction of signal-to-noise ratio not 
lost due to phase offset R can be written as 

R = cos(27r/Aaf • Afi/c), (8) 


where / is gravitational-wave frequency, Ax describes the 
displacement between the two detectors, AH describes 
the angular error due to the rotation of the Earth, and c 
is the speed of light. 

The angular error depends on the segment duration 
relative to a sidereal day. The local coordinates for di¬ 
rections corresponding to the poles (dec = ±90°) do not 
change at all as the Earth rotates. The change is greatest 
for directions in the equatorial plane dec = 0°, for which 


AH 


27T 


St/2 
24 hr' 


(9) 


Here, St is the segment duration. 

If we conservatively assume that the source is located 
at dec = 0°, we can rewrite Eq. [5] as 


R = cos [27r/|Ax/c|(7T(5t/24hr) cos(^)], (10) 


where to = |Ax/c| is the travel time between the two 
detectors («10ms for the Hanford-Livingston pair). If 
we demand that R > 90%, we obtain 


St < 197 s 


1000Hz\ 

— r~) 


( 12 ) 


for the Hanford-Livingston detector pair. Employing a 
more conservative requirement of R > 95%, the prefactor 
becomes 140 s. 


C. Cost 


Now, we compare the computational cost of an anal¬ 
ysis using uncompressed data, consisting, e.g., of 50%- 
overlapping 1 Hz x 1 s spectrogram pixels, to the cost 
of an analysis using compressed data consisting of non¬ 
overlapping 1 Hz x 158 s pixels. After compression, a 
lOhr-long stretch of data is described by only 228 (non¬ 
overlapping) 158 s coarse-grained segments [49|. This 
makes it straightforward to search for ^day-long signals. 
One can analyze signals with durations in excess of one 
week e.g., by splitting the analysis band into overlapping 
sub-bands. 

Further, having reduced the size of the dataset by a fac¬ 
tor of 2(158 s)/ls ~316, an all-sky search using seedless 
clustering will run «316 times faster (than an analysis 
using uncompressed, 50%-overlapping, 1 Hz x 1 s spectro¬ 
grams). Seedless clustering uses phaseless templates to 
approximate narrowband gravitational-wave signals E3- 
The sensitivity of the search, as well as its cost, is influ¬ 
enced by the number of templates El 

The cost of a radiometer search is usually dominated 
by background estimation. The standard procedure relies 
on time slides, in which one data stream is repeatedly 
shifted in time with respect to the other in order to obtain 
many realizations of detector noise. Actual gravitational- 
wave signals do not appear in time-shifted data since the 
coherence is destroyed. In order to detect signals with 
a 3cr false alarm probability of «0.3%, it is necessary to 
analyze the ~370 realizations of time-shifted data. 

Scaling estimates from El , and assuming «3 day-long 
spectrograms, we estimate the cost of background esti¬ 
mation for a realistic search to the 3a level. We assume 
the search is carried out with T150 = 2 x 10 6 templates per 
150 Hz of bandwidth (a typical number of templates 11]). 
We find that it can be carried out using 3.6 continuously- 
running eight-core Intel Xeon E5-4650 CPUs or Kepler 
GK104 GPUs El- This translates to 29 continuously- 
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running CPU cores. The computational cost for back¬ 
ground estimation in a realistic search scales as follows: 



Here, t c is the computation time, t 0 bs is the duration 
of the observing run, B is the width of the analysis 
band, and rots is the number of time slides (in this case, 
corresponding to three-sigma confidence). The variable 
«gpu is the number of GPUs (or, equivalently, 8-core 
CPUs). The ratio of (A f/Sf) describes the compression 
achieved with coarse-graining. For the examples shown 
here, (A f/Sf) = 158. The factor of two is included if we 
replace overlapping windows with non-overlapping win¬ 
dows. 


In practice, the loss of sensitivity can be less if the 
signal is not confined to a single fine-grained bin at any 
given time. Also, fine-grained bins are sometimes con¬ 
taminated with noise artifacts including non-Gaussian 
and non-stationary noise, which are, to some extent, ame¬ 
liorated through coarse-graining. Such noise artifacts can 
therefore serve to make the loss of sensitivity less severe 
in comparison to the idealized case. 

Once the data are coarse-grained, the high-resolution 
information is lost; the process is not invertible. How¬ 
ever, in the event of a candidate detection, one can imag¬ 
ine reanalyzing the original lossless dataset with a tar¬ 
geted search, looking only in a small band and in a spe¬ 
cific direction identified by the coarse-grained search. By 
severely restricting the parameter space of the search, it 
is possible to carry out a computationally feasible follow¬ 
up study using fine-grained data with reasonable compu¬ 
tational resources. 


D. Hybrid compression 

At the end of the previous subsection, we noted that 
there may be applications for which it is helpful to 
carry out hybrid compression: first in the frequency do¬ 
main and then in the time domain. For example, if 
we are interested a particular direction in the sky, one 
can combine many frequency-domain coarse-grained seg¬ 
ments into even longer segments via Eq. |4] Since, by 
assumption, each of the frequency-domain coarse-grained 
segments in the sum already takes into account the phase 
offset between the two detectors, the constraint from 
Eq. Q2] does not apply, and so there is no limit to the 
number of segments that can be combined. In this way, it 
should be possible to represent an arbitrarily long span of 
data with a fixed number of spectrogram pixels, facilitat¬ 
ing a search for a signal of any duration. It is important 
to note, however, that this hybrid technique can only be 
applied by assuming a specific source location, and so it 
does not lend itself to an efficient all-sky search. 

E. Trade-offs 

By coarse-graining spectrographic data with Eq. 0 
we are trading reduced frequency resolution for reduced 
computational burden. As a consequence, the sensitiv¬ 
ity of the search is reduced in comparison to an ide¬ 
alized lossless search using arbitrarily large computing 
resources. To see why, consider a narrowband signal 
passing through just one fine-grained spectrogram pixel 
(width = Sf ) during each segment in time. When the 
data are coarse-grained to a width A/, the signal in this 
single fine-grained bin is averaged with data in neighbor¬ 
ing frequency bins containing only noise. In this worst- 
case scena rio, the d etection statistic SNR tot is reduced by 
a factor yjAf/SF —corresponding to a factor of ~13 for 
the 1 Hz x 158 s pixels used for our demonstration below. 


V. DEMONSTRATION 

Our goal here is to show that very long signals, lasting 
hours or longer, can be successfully identified by employ¬ 
ing an existing pattern recognition algorithm on com¬ 
pressed data. For our first demonstration, we recover the 
signal shown in Fig. QJ. We create a simulated dataset 
spanning two days of Monte Carlo data for the Advanced 
LIGO Hanford and LIGO Livingston detectors operating 
at design sensitivity. The data consists of Gaussian noise 
plus a simulated monochromatic signal with frequency 
/ = 100 Hz and strain amplitude ho = 4 x 10 -24 , located 
at (ra, dec) = (18.5 hr,+39°). The signal is circularly 
polarized. 

The data are cross-correlated to produce spectrograms 
with resolution of (5t,Af) = (158 s, 1 Hz) with non¬ 
overlapping segments. We consider an observing band 
spanning 10-300 Hz. This spectrogram (with a phase 
factor applied to point the radiometer in the appropriate 
direction) is shown in Fig. [T}:. Once the data are com¬ 
pressed, we carry out a seedless clustering search Emu. 

We make no assumptions about the start time, stop 
time, frequency, or sky location of the search—subject 
to the constraint that the signal morphology can be ap¬ 
proximated as a cubic Bezier curve of at least 1.8 hr 
in duration. (Following El, we take advantage of the 
fact that p encodes directional information as a complex 
phase in order to point each seedless template in a dif¬ 
ferent random direction.) We record SNR to t for 2 x 10 6 
randomly-generated templates in order to identify the 
template that best fits the signal. The best-fit template 
(with SNRtot = 56) is shown in Fig. [ljl. The recovery 
represents a reasonably good match to the injected sig¬ 
nal. (It would be better recovered using a larger number 
of templates.) Running on an eight-core CPU, this 48 hr 
stretch of data was analyzed in «550 s. 

As a second test, we attempted to recover a signal with 
a more complicated time-frequency evolution. The signal 
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turns on a;5 hr after the beginning of data-taking and 
ends near t »;17hr during which time the signal evolves 
according to: 


<F(t) = $ 0 + 27r 


/o (t - t 0 ) + -/ (t — t 0 ) 2 


h + = ho cos ($(f)) 
h x = h 0 sin($(t)). 


(14) 


We choose fo = 100 Hz, / = 20 Hz day -1 , and h 0 = 
4 x 10 -24 . The signal is added to simulated noise and 
processed using the same procedure used for the first test 
injection. 

In Fig. [2j.i, we show a spectrogram of p(t; /). The sig¬ 
nal can be seen as a faint track beginning at 5 hr and 
appearing to end at 12 hr. (The last several hours of the 
signal are difficult to see by eye since the alignment of the 
detectors makes it weak during this span.) The recovered 
signal is shown in Fig. [SJd. Once again, we use 2 x 10 6 
cubic Bezier templates. As before, the source is located 
at (ra, dec) = (18.5 hr, +39°), though, this information 
is not provided to the search algorithm. The recovered 
track (with SNR to t = 11) is a good match. 

In Fig. [2}c, we show the spectrogram for an even longer 
week-long signal present in two weeks of data. The signal 
parameters are identical to parameters used in Figs. [2]a,b 
except we use a smaller value of / = 10 Hz day -1 . Using 
2 x 10 6 cubic Bezier templates, we recover the injected 
signal in Fig. [2Ji with SNR to t = 67. 

In Table HI we show the strain values ho for which dif¬ 
ferent signals (such as those shown in Fig. 0 can be de¬ 
tected in 90 days of data with a false alarm probability 
of 1% and a false dismissal probability of 50%. We look 
for signals in spectrograms of duration At = 1 day, and 
in one case, At = 14 days. We simulate noise to deter¬ 
mine the threshold of SNR to t required for 1% false-alarm 
probability detection. Then, we recover simulated signals 
with varied ho to determine the value for which at least 
half of the signals are recovered with a sufficiently large 
value of SNRtot (false dismissal probability = 50%). In 
all cases, we assume an optimally oriented source with 
an optimal sky location. (For randomly oriented sources 
with random sky locations, we expect ho to increase by 
a factor of wl.5-2; see, e.g., (Tlj| A We employ 2 x 10 5 
templates in a 300 Hz bandwidth. 

The results shown in Table Q] can be understood in 
terms of a simple scaling law, which can be used to guide 
our expectations for the detectability of an arbitrary sig¬ 
nal. If the seedless clustering template approximately 
matches the signal we are trying to recover, then the de¬ 
tection statistic scales approximately as follows: 

<15) 

where P(f) is the strain noise power spectral density av¬ 
eraged over the emission frequency and tdur is the du¬ 
ration of emission. While to ur , A/, and P(f) are the 


fo (Hz) 

/ (Hz day 4 ) 

tdur (days) 

At (days) 

ho 

100 

20 

0.29 

1 

3.6 x 10 -24 

100 

0 

0.29 

1 

3.3 x 10 -24 

100 

10 

7 

14 

1.7 x 10 -24 

1000 

0 

0.29 

1 

6.2 x 10 -24 


TABLE I: The detectability of different signals in simu¬ 
lated Advanced LIGO noise. The variables fo and / de¬ 
scribe the signal morphology following Eq. [14] while ta ur is 
the signal duration. At is the duration of each spectrogram 
input to the seedless clustering algorithm. The variable ho is 
the signal strength that can be recovered with a false alarm 
probability of 1% and a false dismissal probability of 50% 
in 90 days of data. We assume optimally oriented sources 
in optimal sky locations. We employ 2 x 10 5 templates per 
spectrogram and analyze data in a 300 Hz bandwidth. We use 
(<5t, A/) = (158 s, 1 Hz) pixels. The sensitivities quoted here 
can very likely be improved by using more templates. 


most important factors in determining the detectability 
of a signal, other variables play a role too. For example, 
using a fixed number of templates, it is harder to detect 
signals as the frequency increases (all else equal) because 
the shrinking diffraction-limited resolution makes it more 
challenging to match the signal to a template [lT]. 

Together, the recoveries demonstrated in Figs. Q] and [2] 
and also Tab. [fl show that it is possible to recover very 
long-lived signals with a range of different signal mor¬ 
phologies in compressed cross-correlated data. The re¬ 
covery works for relatively weak signals and can be car¬ 
ried out with reasonable computational resources. It is 
made feasible by coarse-graining, which allows us to an¬ 
alyze days of data using a single spectrogram. 

In order to place the results of Tab. |T| in context, we 
consider an illustrative emission scenario. We consider a 
neutron star in a binary, which emits gravitational waves, 
for example, through r-mode instabilities, which decay on 
a timescale of a sal week. By assuming accretion-torque 
balance, it has been hypothesized M that the low-mass 
X-ray binary Scorpius X-l emits gravitational waves with 
a strain of ~9 x 10 -26 at / = 100 Hz [4C|. By tuning the 
bin width to match the observed Doppler modulation of 
Scorpius X-l, we expect the sensitivity stated in Tab. U 
could be improved by a factor of (1 Hz/0.05 Hz) 1 / 4 ps 
2 near 100 Hz [501]. This is still about a factor of 10 
below the expected strain from Scorpius X-l assuming 
accretion-torque balance. 

Thus, assuming the hypothesized accretion-torque bal¬ 
ance scenario, advanced detectors, using the method 
presented here, are unlikely to detect very long tran¬ 
sient gravitational waves from a source like Scorpius X-l 
(d = 2.3 kpc). Detection would require a source that is 
either significantly closer or significantly brighter. That 
said, it is important to note that detecting any gravi¬ 
tational waves from neutron stars such as Scorpius X-l 
is a challenging proposition for a variety of search tech¬ 
niques [4l| . The method proposed here will help to cover 
the full parameter space of possible signals from such 


















FIG. 2: Recovery of a very long transient signal with seedless clustering. Top-left (a): a (St, A/) = (158 s, 1 Hz) spectrogram of 
p(t; /|il) (Ea. lA3l) consisting of one day of data. The data consist of Advanced LIGO Monte Carlo noise plus a simulated signal. 
The signal begins at / = 100 Hz and increases with a spin-up given by / = 20Hzday _1 . The amplitude is ho = 4 x 10 -24 . 
The signal persists between t «5-17hr. (It grows very faint after t «12hr due the unfavorable orientation of the Earth during 
this period.) Top-right (b): the recovered signal recovered with 2 x 10 6 cubic Bezier templates; SNRtot = 11. Bottom-left (c): 
a (St, A/) = (158 s, 1 Hz) spectrogram of p(t; /|11) consisting of two weeks of data. The data consist of Advanced LIGO Monte 
Carlo noise plus a simulated signal. The signal begins at / = 100 Hz and increases with a spin-up given by f = lOHzday^ 1 . 
The amplitude is ho = 4 x 1CP 24 . The signal persists between t «3-10days. Bottom-right (d): the recovered signal the most 
significant cluster identified in the bottom-left panel) recovered with 2 x 10 6 cubic Bezier templates; SNR to t = 67. 


sources. 

Before we move on, we note in passing, an interest¬ 
ing difference between this very long transient algorithm 
and the matched filtering algorithm proposed in fl3| . 
In [T 3 J, the authors note that—at a fixed frequency— 
the matched filtering signal-to-noise ratio depends only 
on the total energy emitted in gravitational waves E gw 
and not on the signal duration <dur- However, since radio- 


metric signal-to-noise ratio scales like SNR oc \/tdur h$, 
and since E gw oc blur h'o, SNR oc £ , gw /v / biur- Thus, for 
radiometer analyses, we obtain a different scaling behav¬ 
ior than the matched filter behavior noted in [131 ] : given 
a fixed energy budget, shorter gravitational-wave tran¬ 
sients are easier to detect than longer transients. Given 
a constant flux <f> gw oc dE gv ,/dt, the signal-to-noise ratio 
grows with time: SNR oc <b gw y/tdur- 
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VI. CONCLUSIONS 

We have demonstrated the machinery of an efficient 
search for very long lived transient gravitational waves. 
Such a search will be useful, both looking for very long 
transient signals from neutron stars, and for understand¬ 
ing the time-dependence of signals identified through per¬ 
sistent radiometer searches and stochastic background 
searches. Our procedure relies on coarse-graining to cre¬ 
ate an intermediate data product that is computationally 
manageable to work with. In carrying out our demon¬ 
stration, we explored the modulations in cross-correlated 
data due to the rotation of the Earth. Future work, build¬ 
ing on this demonstration, will explore new physics (and 
instrumental effects) on timescales of hours to days. 

We note that the method described in 0 , which 
builds on existing searches for continuous waves from 
persistent emitters, can likely achieve better sensitivity 
than the method proposed here in its domain of util¬ 
ity: sources following the canonical spin evolution typi¬ 
cal of an isolated pulsar. This is because continuous wave 
searches for isolated neutron stars (see, e.g., HU) assume 
a heavily constrained parameter space of pulsar-like sig¬ 
nals. However, there are important differences between 
radiometry and continuous waves searches, which make 
our new method highly complementary. 

Like all applications of gravitational-wave radiome¬ 
try, the method described here is most useful in cases 
when the signal does not match a simple template. For 
neutron stars in binary systems, radiometer measure¬ 
ments HU HU can be comparable in sensitivity to contin¬ 
uous wave techniques HU , which rely on a more de¬ 
tailed signal model. Further, radiometer searches, includ¬ 
ing the one described here, are extremely robust because 
they do not make assumptions about the phase evolu¬ 
tion of the signal. Finally, in the method described here, 
we allow for significant variation in the frequency evolu¬ 
tion of the gravitational-wave signal, which allows us to 
probe a completely different parameter space compared 
to continuous wave searches, which assume very limited 
changes in the neutron star spin: <6x 10~ 9 Hzs _1 ver¬ 
sus < 300Hzs _1 . 

While we have focused here on signals lasting hours 
to weeks, this method can be extended to target both 
shorter and longer signals as well. While significant work 
has been carried out to develop tools for probing signals 
with durations of 10-1000 s, it may prove useful to apply 
some modest coarse-graining (A f/Sf > 1) to searches 
that are stretched for computational resources. Charac¬ 
terization of Riyear-long signals may require some addi¬ 
tional development since a year-long spectrogram of com¬ 
pressed data still requires significant memory resources. 
However, extrapolating from our analysis here of a two- 
week-long spectrogram, it seems likely that such a search 
could be carried out through a number of options such 
as the use of computers with larger amounts of memory 
>24 GB and/or working with narrower frequency bands. 

Another next step is to consider how realistic searches, 


using the ideas presented here, can be optimized. For 
example, a search for persistent narrowband signals may 
benefit from the use of overlapping frequency bins and/or 
frequency bins of different sizes. Such optimizations 
would be equally useful for any radiometer search. 
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Appendix A: Details of radiometry 


1. Definitions 


Given two simultaneous strain measurements si and 
S 2 , the cross-correlation statistic Y (and its associated 
uncertainty a) can be written as (T) : 


Y(f;/|H) = -Re 


3 2 irifCl-Ax/c 


l 


ei2(t|U) 


s*(£; f)s2{t; f) 


a 2 (t-J\n) = \ 


1 

ei2(f |H) 




(Al) 


Here, s/(t; /) is the discrete Fourier transform of the 
strain series from detector / and P'(t;f) is the auto¬ 
power spectrum calculated using neighboring data seg¬ 
ments. Each segment has duration St. The argument 
t is the starting time of the data segment and / is fre¬ 
quency. M is a Fourier normalization coefficient, f 1 is the 
direction of the source, Ax is the difference in detector 
position, and c is the speed of light. Above, we carry 
out simulations for the Advanced LIGO network (con¬ 
sisting of detectors in Hanford, WA and Livingston, LA) 
for which |Ax| /c ~ 10ms. The eiA/IH) factor describes 
the efficiency of the detector pair [L|: 

e 12 m = -j2 FfmFfm. (A2) 

A 

Here, Ff(t\Cl) is the antenna factor HU for detector I 
and A = +, x are the different polarization states. 
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It is useful to define “pixel signal-to-noise ratio”: 


vary significantly in frequency, then we can write 


= (A3) 

We use p(t\ f\Cl) for its convenience as a tool for visual 
representation of cross-correlated data and also because 
it allows for compact derivations. In the context of all¬ 
sky searches, it is also useful to define “complex signal- 
to-noise ratio” EH 


Yi = pi<ro/e\ 


(A7) 


— & o / 612- 

Recall that e\ 2 (Eq. IA2I) is an efficiency factor arising 
from the detector response function, which changes in 
time as the Earth rotates. The variable do is some num¬ 
ber determined only by the detector strain sensitivities. 
Combining Eas. IA6l and lA7l we obtain: 


p{t-f) = k I f), (A4) 

where, for the sake of compactness, we define a normal¬ 
ization factor 


k = 2V2/JV. (A5) 

Comparing Eci. lA3l and lA4[ we see that p(t; /) is the same 
as p(t\ f) except no phase factor has been applied, and 
we do not take the real part. In doing so, we obtain a 
data product with information from all directions on the 
sky. 


2. Clustering 


Having defined spectrograms of p or p (Eqs. IA3I 
and IA4I1 . the next step in a radiometer analysis is to 
look for clusters of statistically significant excess coher¬ 
ence. Details of the clustering procedure depend on the 
specific search, but there are certain common features. 

First, a rule is defined for how clusters can be formed. 
In the case of the targeted narrowband radiometer El, 
[34l . |38| , the signal is assumed to persist with a fixed fre¬ 
quency, and so each cluster corresponds to the set of all 
pixels at a given frequency; i.e., one spectrogram row. 

Once a cluster is defined, all the pixels in that cluster 
are combined in order to produce a detection statistic 
for that cluster denoted SNR tl0t . The detection statistic 
(for a targeted search with a known sky location) can be 
written as as a weighted average EH, 0, 38): 


SNR 


tot 


Eigr Yj Qj 2 

(E, er <V 2 ) 1/2 ' 


(A6) 


(We have suppressed the dependence on H for the sake 
of brevity.) The sum over i runs over some set of (f; /) 
pixels T, which is determined by details of the search. 
A search for persistent narrowband signals, for example, 
might sum over time at a fixed frequency. In searches for 
transient signals, the set of (£; /) p ixels may be described, 
e.g., by a Bezier curve il0lElll37j. 

If we assume for the sake of simplicity that the noise 
is approximately stationary, and that the signal does not 


SNR tot 


Eigr Pi e 12 

(Eigrfe) 2 ) 172 ' 


(A8) 


3. All-sky search 


To this point in the appendix, we have assumed that 
the sky location fl is known a priori. This assumption 
holds for targeted searches, e.g., searching for gravita¬ 
tional waves from the low-mass X-ray binary, Scorpius 
X-l dij. When the sky location is not known, the search 
becomes more complicated. To see this, consider an all¬ 
sky search which considers some finite set of directions 
| fit |. If we look in some direction 0,, which is offset 

from the true source direction fit, then the phase factor 
in Eq. [ATI will not rotate the signal power perfectly into 
the real number axis. As a result, signal power leaks into 
the imaginary direction, and can even produce negative 
power on the real axis. 

In order to carry out an efficient all-sky search, we can 
use Eqs. IA8I and IA4I to define a detection statistic that 
can quickly scan over sky locations: 


SNRtot (n) = 


Re [Eigr ex P H’J'i) pi 4s 


GCjgr( e i 2 ) 2 ) 


1/2 


(A9) 


(This definition of SNRtot is essentially the same one 
that is proposed in EH .) Graphically, Eq. IA9I can be 
interpreted as applying an B-dependent phase array to 
a complex-valued spectrogram (as in Figs. |T]i and CD 3 ) 
before summing over clusters of pixels. 

At first glance, Ea. lA9l mav appear tautological. There 
is already implicit dependence on H in Eq. IA8I hiding in 
Pi, ej, and therefore in SNR to t- However, the ingredients 
used in Eq. IA8I are calculated for a specific sky location 
B in such a way that the information about all other sky 
locations is lost when the imaginary part is discarded 
in Eq. IAT1 Thus, the formulation described by Eq. IA9I 
allows us to search the entire sky with a single data prod¬ 
uct. Using parallel computing architecture, we can effi¬ 
ciently search many directions in parallel with a single 
manageable array, dramatically speeding up calculations 
in the process [DJ . 
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